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Abstract 

We  propose  an  algorithm  for  ike  design  of  multiple- 
valued  current-mode  CMOS  logic  (CMCL)  circuits  that 
is  based  on  the  cost-table  technique.  The  algorithm  is 
a  heuristic  search  iechniquefAO*  algorithm)  [10,11]  ap¬ 
plied  to  an  AND/OR  tree.  It  is  significantly  faster  than 
Exhaustive  Search  while  providing  realizations  that  are 
almost  as  good.  A  new  cost-table  is  also  proposed  that 
results  in  better  realizations  than  obtained  with  a  previ¬ 
ous  cost-table  [14]. 

1  Introduction 

The  development  of  multiple-valued  current-mode 
CMOS  circuits  (CMCL)  [3,12,13,14]  has  resulted  in  a 
need  for  design  techniques  for  such  circuits.  The  de¬ 
velopment  of  charge- coupled  device  (CCDl  circuits  has 
inspired  a  close  examination  of  the  cost-taole  technique 
[2,8,9,14,16].  A  natural  extension  of  this  work  is  a  cost- 
table  technique  for  CMCL  [14].  In  this  paper,  we  show 
an  improved  cost-table  for  CMCL.  The  improvement  oc¬ 
curs  in  two  ways.  First,  the  table  is  extended  to  include 
more  operations.  Second,  an  Exhaustive  Search  tech¬ 
nique  is  replaced  by  an  efficient  search  technique  which 
is  similar  to  AO*  algorithm  [10,11]. 

This  paper  is  divided  as  follows;  Section  2  is  a  descrip¬ 
tion  of  notation,  Section  3  describes  the  new  cost-table 
and  decompsition  method,  Section  4  is  a  description  of 
improvements  obtained,  and  Section  5  is  the  conclu¬ 
sions. 

2  Background,  Notation  and  Basic  Cir¬ 
cuit  Elements 

Let  /2  =  {0, 1,  •  •  • ,  r- 1}  be  a  set  of  r  logic  values,  where 
r  >  2.  Let  X  =  {xi,X2,  •  •  •  be  a  set  of  n  variables, 
where  Xi  takes  on  values  from  R.  A  function  f{X)  is  a 
mapping  f  :  R.  If  A  is  a  single  variable  x,  f  (x)  is 

represented  as  an  r-tuple,  <  /(0),/(l),  • '  •  ,/(r  -  1)  >. 
For  example,  if  r  =  4,  then  /(x)  =<  3, 2, 1,0  >  repre¬ 
sents  a  complement  function  in  which  0  maps  to  3,  1  to 
2,  2  to  1,  and  3  to  0.  Let  be  the  set  of  all  r- valued 
functions  of  n  r- valued  inputs.  Let  c(/),  the  cost  of  func¬ 
tion  /,  be  a  mapping  c  :  [Jn  — ►  where  is  the 
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set  of  real  numbers.  The  cost  function  c(f)  introduced 
by  Kerkhoff  and  Robroek  [8]  for  the  design  of  4- valued 
CCD  logic  circuits  correlates  closely  with  the  chip  area 
occuppied  by  the  most  compact  implementation  of  /. 

In  the  realization  of  a  given  function  by  cost-table,  cost- 
table  functions  are  combined  using  a  connecting  oper¬ 
ation.  The  connecting  operation  -f  between  functions 
used  in  this  paper  is  similar  to  ordinary  addition  with 
logic  values  viewed  as  integers.  That  is,  if  /(A)  is  repre¬ 
sented  as  the  sum  fiX)  =  /i(A)+/2(A)-l - h/mfA), 

then,  for  any  assignment  i/  of  values  to  A,  f{t/)  = 
fiM  f  /2(»^)  +  •••-!-  /m(i'),  where  each  fi{y)  is  taken 
as  an  integer  and  where  +  is  ordinary  addition,  except 
when  the  sum  exceeds  r  —  1,  the  highest  output  logic 
value,  in  which  case  +  is  undefined.  For  example,  if 
/i(j)  =<  0,1, 2, 3  >  and  /2(x)  =<  3,2, 1,0  >,  then 
/i(xW/2(xl  =<  3, 3, 3, 3  >  and  /i(x)-K/i(x)  is  unde- 
finea.  The  nrst  example  shows  that  the  sum  of  the  iden¬ 
tity  function  <  0, 1,2,3  >  and  the  complement  function 

<  3,2, 1,0  >  is  the  constant  function  <  3,3,3,3  >. 

Let  s  be  the  cost  of  realizing  the  sum  operation  (-f) 
between  two  functions.  Thus,  the  cost  of  the  realization 
/  =  /1+/2+  •  •  •  +/m  is  c(/,)  +c{fi)  +  ■  ■  • +c(/m)+ (m  - 
l)s,  where  the  last  term  is  the  cost  of  (m  —  1)  two-input 
adders. 

Function  /  is  a  basis  function  if  and  only  if  /(A)  is  1 
for  exactly  one  assignment  of  values  to  A  and  is  0  oth¬ 
erwise.  Let  BT  be  the  union  of  all  basis  functions  and 
the  constant  0  function.  BT  is  called  the  basis  cost- 
table.  P  is  a  cosi-table  if  and  only  if  BT  C  F  C 
The  condition  BT  C  F  guarantees  that  all  functions 
can  be  realized  as  the  sum  (H-)  of  cost-table  functions. 
For  example,  in  [jJ,  BT  =  {<0,0,0, 0>,  <0,0,0, 1  >, 
<0,0,1,0>,  <0,1,0,0>,  <1,0,0,0>}.  If  <0,0,0, 1> 
is  missing,  it  is  impossible  to  realize  certain  functions, 
including  <  0,0,0, 1  >  itself.  Conversely,  any  function 

<  00,01,02,03  >  can  be  realized  as  the  sum  of  func¬ 
tions  exclusively  from  BT,  specifically  Oo  functions  of 
the  form  <  1,0, 0,0  >,  oi  <  0, 1,0,0  >,  02  <  0,0, 1,0  >, 
and  03  <  0,0,0, 1  >. 

cf(/),  the  cost  of  realizing  /  €  Un  respect  to  cost- 
table  F,  is  the  minimal  cost  reafization,  specifically 

cr(f)  =  n»«n/.,.,..,„€F{c(/i)  +  •  •  •  +  c(/m)  +  (m  -  l)s), 

where  c  is  the  cost  function.  /  =  /i 4-/2 -I - +/m  is 

said  to  be  a  minimal  cost  realization  of  /. 
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Two  cost  functions  are  of  special  interest. 

Transistor  Count 

This  cost  function  was  introduced  in  [14]  as  a  measure 
of  the  area  in  an  implementation  of  a  circuit.  Area  cor¬ 
relates  closely  with  the  number  of  transistors  needed 
in  the  minimal  circuit  implementation.  It  is  straight¬ 
forward  to  calculate  given  the  circuit,  and  we  use  it 
here.  Because  the  adder  uses  no  transistors,  its  cost,  s, 
is  0. 

Because  it  is  useful  in  predicting  cost  in  the  AO*  algo> 
rithm,  we  also  use  the  following. 

Sum  Cost 

Given  f{x)  =<  -  ••  >,  the  sum  cost  is 

SC{f{x))  =  xo  +  + - h  Xr_i. 

For  example,  SC{<  1,1, 1,1  >)  =  4  and 

SC{<  3,0,1,2>)  =  6. 

The  elements  in  the  cost-table  of  [14]  were  chosen  by 
analyzing  many  CMCL  circuits  [5,6,12,13,14,17].  We 
consider  here  two  kinds  of  logic  element  sets.  One  set 
includes  sum,  constant,  and  mirror,  which  were  used  in 
[141.  The  other  includes  threshold  detector  logic.  In 
addition  to  the  set  of  basic  elements  in  [14],  threshold 
detector  logic  in  [5,6,17]  and  their  modihcations  are  also 
used. 

•  Sum.  The  simplest  operation  in  the  current  do- 
m«n  is  arithmetic  addition.  The  sum  is  the  only 
connective  operator  in  the  set  of  basic  circuit  ele¬ 
ments.  The  circuit  realization  of  the  sum,  as  shown 
in  Fig.  1,  is  simply  an  interconnection  between  all 
the  inputs  and  the  output. 

•  Constant.  The  second  basic  element  is  the  con¬ 
stant  generator,  as  shown  in  Fig.  1.  It  can  pro¬ 
vide  any  positive  or  negative  integer  value  between 

(r  -  1)  (a  negative  logic  value)  and  (r  -  1),  where 
r  denotes  the  radix.  A  negative  logic  value,  i,  repre¬ 
sents  a  logic  value  corresponding  to  the  same  quan¬ 
tity  of  current  that  is  represented  by  i,  except  that 
the  flow  is  in  the  opposite  direction.  The  circuit  of 
a  constant  generator  consists  of  one  or  more  MOS 
N-type  transistor(8)  or  P-type  transistor(s).  The 
gate(s)  of  the  transistor(s)  are  connected  to  a  ref¬ 
erence  voltage,  which  can  be  generated  locally. 

•  N-type  and  P-type  mirror.  The  mirrors  are 
one-input,  multiple  output  logic  operators  of  which 
there  are  two  kinds,  an  N-type  and  a  P-type.  The 
mathematical  functions  bndp  (bound-positive)  and 
bndn  (bound-negative)  are  used  in  the  algeoraic 
representation  of  these  mirrors.  They  are  defined 
as; 

bndp{x)  =  max(0,x) 
bndn{x)  =  min(0,  x) 

For  example,  if  the  input  string  with  component  x 
w{S210123},  the  string  ootained  by  applying 


bndp{x)  and  5ndn(x)  i8f0000123}  and  {  £  2  1  0 
0  0  0  },  respectively.  Tne  circuits  of  these  mirrors 
are  shown  in  Fig.  1.  Multiple  copies  of  the  out¬ 
put  signals  are  Stained  by  using  separate  output 
transistors.  Multiplication  of  the  output  signals  by 
an  integer  is  obtained  by  connecting  the  mains  of 
several  identical  output  transistors  together. 

There  are  two  kinds  of  threshold  detector  circuits. 
When  input  x  is  m  or  greater,  one  produces  x  at  the 
output,  while  the  other  produces  a  constant  c.  When  x 
is  less  than  m,  both  kinds  produce  0. 

•  Threshold  detector.  A  voltage-switched  cur¬ 
rent  source  threshold  detector  can  be  implemented 
with  a  constant  generator  and  a  pass-transistor,  as 
shown  in  Fig.  1.  A  threshold  detector  with  con¬ 
stant  generator  was  designed  with  those  basic  el¬ 
ements  in  [5,17].  Fig.  2a  shows  a  circuit  that  is 
identical  except  for  the  addition  of  a  current  mirror 
to  invert  the  input  current  direction.  Considering 
Fig.  2a,  the  operation  of  the  threshold  detector  is 
as  follows:  if  the  control  input  current  x  in  the  up¬ 
per  side  of  the  box  in  the  symbol  diagram  of  Fig,  2 
is  smaller  than  or  equal  to  the  threshold  m,  the  out¬ 
put  current  is  0.  On  the  other  hand,  if  the  control 
input  current  x  is  greater  than  m,  the  gate  output  is 
connected  by  pass-transistor  T1  to  a  current  source 
T2.  In  Fig.  2b,  however,  T2  produces  the  value 
"x”.  The  function  realized  is  defined  as 

{constant  c  or  input  x 
if  m  <  X 
0  otherwise. 

The  cost  Q{f)  of  TD(m,  c)(Fig.  2, a)  is  2  +  m  + 
c.  The  2  occurs  because  there  are  3  transistors 
of  cost  1,  one  of  which  is  shared  with  other  cost- 
table  functions  (the  P-type  input  transistor).  The 
m  occurs  because  the  transistor  in  Fig.  2. a  labeled 
m  occupies  m  times  the  area  of  each  of  the  three 
transistors  discussed  above.  The  c  occurs  because 
of  the  transistor  in  Fig.  2.a  labeled  by  c.  The 
cost  of  TjD(m,  x)(Fig.  2.b^  is  5  H-  m.  There  are  six 
transistors  of  size  1  (mcluaing  T1  and  T21  of  which 
one,  the  input  transistor,  is  not  counted  oecause  it 
is  shared  with  other  cost-table  functions.  The  m 
occurs  because  of  the  transistor  labeled  m  in  Fig. 
2.b. 

•  Modified  threshold  detector.  From  the  above 
threshold  detector,  a  modified  threshold  detector, 
as  shown  in  Fig.  1,  was  used  in  [5].  This  b  defined 
as 

f  constant  c  or  input  x 
MTD{fnl,  m2,  x)  =  <  if  ml  <  x  <  m2 
(  0  otherwise. 

The  cost  of  A/T£>(ml,  m2  :  c)  b  4  +  ml  -f  m2  -f  c. 
The  4  occurs  because  of  the  2  transbtors  of  size  1 
shown  in  Fig.  1  and  2  additional  transbtors  (not 
shown)  needed  for  the  current  mirror.  Again,  one  of 
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Figure  1:  The  basic  set  of  CMCL  circuit 


these  is  not  counted  because  it  is  shared  with  other 
functions.  Similarly,  the  cost  of  AfrD(ml,m2  :  x) 
can  be  calculated  as  7+ml  +  m2.  The  operation  of 
the  modihed  threshold  detector  logic  is  same  as  the 
literal  generator  and  is  useful  in  implementing  the 
Universal  Unary  Programmable  Circuit(UUPC)  or 
T-gate,  and  Programmable  Logic  Array8(PLAs). 

•  Inverse  threshold  detector.  The  inverse  thresh¬ 
old  detector  or  down-literal  generator  operates  as 
the  inverse  operation  of  thre^old  detector.  It  can 
be  implemented  as  a  modification  of  the  threshold 
detector  and  is  defined  as 

f  constant  c  or  input  x 
lTD{m,  x)  =  <  if  m  >  X 

(  0  otherwise. 

Its  symbol  and  circuit  dianam  are  shown  in  Fig. 
1.  The  cost  Q{f)  of  /rD(m,c)  is  3  +  m  +  c  with 


constant  value  "c”  or  5+ m  for  /r£>(m,  x)  with  the 
pass-transistor. 

3  The  Proposed  Cost-table  Technique 

In  this  section,  a  cost-table  d^ign  algorithm  is  shown 
which  yields  improved  realizations  over  a  previous  algo¬ 
rithm.  The  improvements  are  due  to  insight  gained  by 
an  operation  called  Vertical  Partitioning  (VP). 

3.1  Cost-table  circuits 

The  circuit  structure  of  CMCL  cost-table  functions  con¬ 
sists  of  three  parts  ([14]);  a  distribution  circuit  for  the 
input,  circuits  that  realize  the  cost-table  functions,  and 
an  output  summator  circuit. 

Cost-table  functions  in  [14]  were  realized  using  the  cir¬ 
cuits  shown  in  Fig.  3.a  and  3.b.  We  augment  the 
cost-table  in  [14]  with  functions  that  can  be  realized 
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Figure  2:  The  operation  of  threshold  detector  logic 
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Table  1:  Cost-table  with  the  circuits  of  Fig.  3a(circuit 
a),  3b(circuit  b). 
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Figure  3:  Cost-table  circuit  structures 


using  a  multiplication.  For  examp le^  instead  of  just 

<  0,0,0, 1  >,  we  choose  to  include  <  0,0, 0,2  >  and 

<  0,0, 0,3  >  also.  Table  1  shows  the  augmented  cost- 
table.  This  increases  the  cost-table  size  from  37  to  53. 
Although  the  cost-table  size  is  increased  by  more  than 
40%,  the  algorithm  is  simplihed  using  this  cost-table. 

A  further  improvement  can  be  obtained  by  using  the 
threshold  operations  explained  in  the  previous  sec¬ 
tion.  Specifically,  the  threshold  detector  TD{m,  c)  and 
TDfm,  x).  MTD^u m2,c)  and  MTDfmi,  m2, x),  and 
/ri)(m,c)  and  /rD(m,x)  are  used.  This  results  in  an 
improvement  in  certain  costs  in  Table  1.  The  resulting 
cost-table  is  shown  in  I^ble  2.  As  with  Table  2,  this 
table  lists  the  circuit  structure  type,  the  values  of  the 
parameters,  the  cost,  and  the  values  of  the  cost  func¬ 
tions. 

In  CMCL,  the  area  of  the  circuit  correlates  closely  with 
the  number  of  transistors.  As  in  the  studies  of  CCD,  it 
is  of  interest  to  compare  the  costs  of  frinctions  obtained 


from  a  rigidly  specified  table  of  costs  to  costs  derived 
directly  from  the  functions  themselves,  that  is,  costs 
of  four  previously  defined  cost  functions  TC  [16],  TTS 
[16],  SUM  [16],  and  BC  [1].  We  do  the  same  in  this 
study. 

For  the  decomposition  program,  it  is  interesting  to  know 
how  the  transistor  counts  of  the  cost-table  functions 
and  the  mathematical  cost  functions  are  related  to  each 
other.  Table  2  also  lists  the  relations  between  the 
TTSf  SU Af,  BCf  and  TC  cost  functions  and  transistor 
count  Q  for  the  functions  in  the  cost-table  in  [14].  For 
the  given  functions,  the  mathematical  cost  functions  are 
fixed.  Therefore,  the  values  of  the  mathematical  cost 
functions  are  not  duplicated  in  Table  1. 

As  seen  in  [20],  for  all  cost-table  functions,  TTS  <  7, 
SUM  <  10,  bC  <  4  and  TC  <  2.  Even  though  we 
add  the  threshold  detector  logic  circuits  as  basic  circuit 
elements,  the  correlation  between  the  cost  Q  and  the 
mathematical  cost  functions  b  weak  as  shown  in  Table 
2.  Because  of  this  negative  correlation,  these  cost  func¬ 
tions  are  not  useful  in  a  direct  way  to  realize  a  function 
by  the  cost-table  technique.  However,  one,  the  SUM,  is 
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Table  2:  Proposed  Cost-table  for  4-valued  CM  CL  func¬ 
tions  with  the  circuits  of  Fig.  3a(circuit  a),  Sb(circuit 
b),  3c(circuit  c),  3d(circuit  d)  and  3e(circuit  e). 
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useful  indirectly,  as  we  show  later. 


3.2  The  proposed  cost-table  decomposition 
algorithm. 

The  addition  of  the  threshold  detector  logic  to  the  ba¬ 
sic  logic  in  [14]  allows  functions  to  be  generated  with 
less  cost  than  without  this  logic.  Fig.  4  shows  how  the 
various  functions  in  the  two  cost-tables  are  distributed. 
Data  points  in  boxes  indicate  number  of  functions  from 
the  old  cost-table  (Table  1),  while  data  points  in  x’s  in¬ 
dicate  number  of  functions  from  the  new  cost-table  (Tar 
ble  2).  The  lower  cost  of  the  new  cost-table  is  shown  by 
the  Rouping  of  x’s  to  the  left  of  the  boxes.  However, 
the  increase  in  the  functions  of  the  cost-table  renders 
the  direct  application  of  the  existing  decomposition  al¬ 
gorithms  difficult,  because  the  decomposition  algorithm 
seeirches  the  cost-table  at  eeu:h  step.  An  analysis  of  the 
functions  in  the  cost-table  reveals  the  following: 


Number  of  Functior^s 


Cost  of  Function 

cost-table  Table  1  “^  cost-table  Table  2 


Figure  4:  Comparison  of  the  cost  functions  in  the  cost- 
tables 


•  Constant  functions  are  the  least  expensive  func¬ 
tions. 

•  Monotonously  increasing  functions  (stair-case  func¬ 
tions)  such  as  <  0,1, 2, 3  >,  <  0,0, 1,2  >, 

<  0,0, 0,1  >,  <  0,1,2,0  >,  <  0, 1,0,0  >  and 

<  1,2, 3,0  >  can  be  obtained  at  moderate  cost. 

•  All  step-up  functions  (e.g.  <  0,0, 2, 2  >  )  and  all 
step-down  functions  (e.g.  <  2, 2,0,0  >  )  are  in  the 
table. 

•  Most  functions  like  <  a:o,xi,0,0  >  and 

<  0,0,a:2)®3  >  1'^®  cost-table,  where 

a?oj*ii*2i*3  we  any  values  in  radix  4  (i.e.  0, 

1,  2  and  3).  The  exceptions  are  <  0,0, 3, 2  >, 

<  2,3,0,0  >,  and  <  3,2,0,0  >. 

The  following  definitions  are  useful  in  fomulating  the 
decomposition  method.  We  allow  the  radix  r  to  take  on 
any  vdue,  and  so  we  consider  general  r-valued  unary 
functions  f(x)  —<  a;o>  ®xr  *  *  >  >. 

Definition  1:  Let  imax  be  the  smallest  index  i  such 
that  Xj  =  0  and  ari.fi  ^  0.  imax  is  said  to  be  the  right- 
most  internal  0  of  f(x).  If  no  Xg  is  0  or  the  only  O’s 
extend  consecutively  nom  t  =  r  —  1  to  lower  values,  the 
rightmost  internal  0  does  not  exist. 

Example:  The  functions  <  1,3,0, 1  >,  <  2, 0,0, 3  >, 
and  <  0,0,0, 1  >  all  have  a  rightmost  internal  0  at 
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t  =  2.  The  leftmost  internal  0  is  defined  in  a  8iinil2kr 
manner.  However,  we  do  not  make  use  of  it  here. 

Definition  2:  If  j  is  the  index  of  the  rightmost  internal 
0,  then  LF{x)  =  <  ••  ,0  >  is  the  left 

function  and  =  <  0,  •  •  • , 0,  •  •  • , Xr-i  > 

is  the  right  function. 

Example:  For  the  functions  <  1, 3, 0, 1  >,  <  2, 0, 0, 3  > 
and  <  0,0,0,1  >,  LF(x)  =<  1,3,0,0  >.  <  2, 0,0,0  >, 
and  <  0,0, 0,0  >,  respectively,  while  RF{x)  is 

<  0,0,0, 1  >,  <  0,0, 0,3  >,  and  <  0,0,0, 1  >,  re* 
spectively. 

These  concepts  are  useful  in  decomposing  a  given  func¬ 
tion  into  two  subfunctions.  That  is,  the  algorithm 
shown  later  accepts  a  function  /(x)  and  attempts  to 
decompose  it  into  two  subfunctions  LF{x)  and  FF(x). 

Definition  3:  /(x)  =<  c,c,  •  •  •  ,c  >  is  a  constant  func¬ 
tion  for  0  <  c  <  r  —  1. 

Example:  <  1,1, 1,1  >,  <  2, 2, 2, 2  >,  and 

<  3  ,  3  ,  3  ,  3  >  are  the  three  constant  functions  for 
4- valued  unary  functions. 

Definition  4;  /(x)  =<  xo,xi, •  •  •  ,Xr-.i  >  is  an  up- 
staircase  {down-staircase)  function  if  /(x)  is  not  a  con¬ 
stant  function  and  Xi  =  bndp{i  —  c),  for  0  <  c  <  r  —  2 

(  Xi  =  bndp{c  —  t),  for  1  <  c  <  r  —  1  ).  /(x)  is  a  stair¬ 

case  function  if  it  is  either  an  up-staircase  function  or  a 
down-staircase  function. 

Example:  <  0, 1,2,3  >,  <  0,0, 1,2  >,  and 

<  0  ,  0  ,  0  ,  I  >  are  up-staircase  functions,  while 

<  3  ,  2  ,  1  ,  0  >,  <2,1,0,0>,  and  <1,0,0,0>  are 
down-staircase  functions. 

Detinition  5:  /(x)  =<  xo,Xi, •  •  • , Xr-i  >  is  a  partial 
up-staircase  {partial  down-staircase  function)^  if  /(x)  is 
not  a  constant  function,  and  x,-  =  bndp{i  —  c)  for  0  < 
c<r-'2(xi  =  fcn(ip(c  —  »),  for  1  <  c  <  r  —  1  ),  where 
0<i<m(m<t<r—l)  and  x,-  =  0  for  m  <  t  f 
I  <  m  j,  where  m>c+l(m<c— 1).  f{x)  is  a  parti(^ 
staircase  function  if  it  is  either  a  parti^  up-st»rcase 
function  or  a  partial  down-staircase  function. 

Example:  <  0, 1,2,3  >,  <  0,0, 1,2  >,  <  0,0,0, 1  >, 

<  0, 1,0,0  >,  <  0, 1,2,0  >,  and  <  0,0, 1,0  >  are 
partial  up-staurcase  functions,  while  <  3,2, 1,0  >, 

<  2  ,  1  ,  0  ,  0  >,  <  1,0, 0,0  >,  <  0,0, 1,0  >, 

<  0  ,  2  ,  1  ,  0  >,  and  <  0, 1,0,0  >  are  partial 

down-staircase  functions. 

Definition  6:  f{x)  =<  xo,Xi,  • .  ♦ ,  Xr-i  >  is  a  block 
function  iff  x<  =  c  for  i^in  <  »  <  *mar  and  Xi  =  0 
for  t  <  imin  and  <  i,  where  1  <  c  <  r  ~  1  and 
0  ^  fmtn  ^  ^mas  ^  Ij  such  that  tmm  and  imax  are 
not  0  and  r  -- 1  simultaneously. 

Example:  When  r  =  4,  there  are  15  block  functions 

<  0, 1,1,1  >,  <  0,2,2, 2  >,  <  0,3,3,3  >,  <  0, 0, 1,1  >, 

<  0,0,2, 2  >,  <  0,0, 3, 3  >,  <  0, 1,1,0  >,  <  0,2, 2,0  >, 

<  0,3, 3,0  >,  <  1, 1,0,0  >,  <  2,2, 0,0  >,  <  3, 3, 0,0  >, 

<  1,1,1,0  >,  <  2,2,2,0  >,  and  <  3, 3, 3,0  >. 

In  the  algorithm  described  next,  a  given  function  /(x) 
is  tested  to  determine  if  a  nonzero  constant  function 
can  be  subtracted  from  it.  Next,  an  internal  0  b  sought 
and  the  corresponding  right  and  left  functions  FF^x) 


and  LF(x)  are  extracted.  These  are  then  decomposed 
further. 

Thb  process  is  performed  using  an  AND/OR  tree.  From 
certain  nodes,  there  are  AND  arcs  which  lead  to  two 
or  more  nodes,  all  of  which  must  be  solved  to  solve 
the  ^ven  node.  Alternatively,  if  a  node  has  OR  arcs 
leading  to  two  or  more  nodes,  only  one  of  those  nodes 
must  be  solved  in  order  for  the  initial  node  to  be  solved. 
Thb  algorithmic  process  is  similar  to  the  well-known  A* 
algorithm  [4].  But  specifically,  we  use  the  tree  structure 
of  the  AO*  algorithm  [10,11]  to  find  solutions  in  the 
AND_OR  tree,  and  our  terminology  b  similar  to  that 
of  [15].  The  algorithm  described  here  finds  a  path  from 
the  starting  node  to  a  set  of  nodes  representing  the  cost- 
table  functions. 

The  proposed  Vertical  Partitioning  algorithm  (VP)  pro¬ 
ceeds  by  searching  an  AND.OR  tree  [15].  In  this  tree, 
nodes  represent  functions.  As  the  search  proceeds,  cost 
estimates  are  generated  at  each  node.  These  estimates 
are  updated  as  more  information  is  gathered  in  the 
search  and,  in  turn,  are  used  to  direct  the  search  to¬ 
ward  productive  paths. 

Vertical  Partitioning  Algorithm 

1.  Let  the  initial  AND.OR  tree  consist  only  of  the 
node  representing  /  =  /(x),  which  is  called  IN  IT. 
Set  this  node’s  cost,  Q{f),  to  oo,  and  denote  this 
level  as  0. 

2.  Generate  levels  1,  2,  ...and  thus  the  AND.OR 
tree(AO-tree),  until  all  leaves  become  a  cost-table 
function  or  FUTILITY,  The  AO-tree  has  AND 
node(s)  on  leveb  of  odd  value  and  has  OR  node(s) 
on  leveb  of  even  value.  The  algorithm  proceeds  un¬ 
til  INIT  b  labeled  SOLVED.  Repeat  the  following 
operation  at  each  level  until  all  leaves  generated  are 
marked  SOLVED  or  FUTILITY: 

a.  At  each  AND  node  function  /,  i.e.,  level  0, 
2,  4,  ...  ,  let  z  be  the  rightmost  internal  0. 
If  z  exists,  partition  /  into  two  subfunctions 
/i  and  /2,  where  fi  is  the  left  function(LF) 
and  f2  b  the  right  functionfRF)  of  /.  If  z 
does  not  exist,  let  /i  =  /,  ana  mark  /2  as  FU¬ 
TILITY,  which  b  equivalent  to  specifying  the 
AND  node  as  not  partitioned.  Form  the  AO- 
tree  nodes  /i  and  /g.  If  /i  or  /2  is  in  the  cost- 
table,  mark  the  corresponding  node  SOLVED, 
and  assign  it  the  Q  value  of  the  function’s  cost 
from  the  cost-table.  If  fi  or  /2  b  not  in  the 
cost-table,  do  the  following: 

b.  At  each  OR  node  function,  f,  level  1,  3,  5, 

...  ,  generate  a  component  function  f  of 

/  of  the  following  three  types  1.  a  partial 
staircase  function(assigned  to  the  left-hand 
ORjBuccessor),  2.  a  mock  function(assigned 
to  the  middle  OR-successor),  and  3.  a  non¬ 
zero  constant  functionf^igned  to  the  right- 
hand  ORjBuccessor).  Each  component  func¬ 
tion  /  has  the  property  that  1.  /'is  in  the 
cost-table,  2.  /'(x)  <  /(x)  for  0  <  x  <  r  -  1 
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in  radix  r,  and  3.  there  is  no  other  compo¬ 
nent  function  of  the  some  type  with  a  larger 
SUM  cost  function.  Furthermore ,  among  the 
choices  of  component  functions  with  the  same 
Isugest  SUM  cost,  select  the  one  with  the  low¬ 
est  Q( cost-table  cost).  If  no  function  of  any 
of  the  three  types  of  functions  can  be  sub¬ 
tracted,  the  corresponding  ORjsuccessor  node 
is  marked  FULTILITY.  Next, 

i.  choose  the  arc  associated  with  a  compo¬ 
nent  function,  /',  that  has  the  lowest  Q 
value. 

ii.  generate  the  ORjsuccessor  by  subtract¬ 
ing  the  corresponding  component  func¬ 
tion  from  the  OR  node  function. 

iii.  if  the  ORjsuccessor  is  found  in  the  cost- 
table,  label  it  SOLVED  and  assign  its  Q 
value  from  the  cost-table. 

iv.  depth  first  recursion:  If  an  OR-successor 
cannot  be  found  in  the  cost-table,  re¬ 
peat  the  above  AND/OR  decomposition 
until  all  successive  leaves  are  marked  as 
SOLVED. 

V.  backtrack:  When  a  node  status  changes 
to  SOLVED  and  its  cost  changes,  its  par¬ 
ent  node  cost  is  re-evaluated  to  determine 
if  a  lower  cost  can  be  assigned  to  the  par¬ 
ent  node. 

vi.  repeat  Steps  (ii)  to  (iv)  for  other 
OR^uccessors  it  the  Q  values  for  their 
component  functions  are  less  than  the 
cost  of  the  OR  node  found  from 
the  completed  evaluation  of  the  first 
ORjsuccessor. 

3.  Besides  the  decompositions  considered  in  Steps  1 
and  2  above,  which  are  based  on  vertical  partitions 
about  the  leftmost  internal  0,  consider  other  ver¬ 
tical  partitions.  Compute  the  cost  of  all  vertical 
partitions  of  the  given  function,  as  it  is  decomposed 
into  subfuctions  <  Xq  i  ®i  ,  *  *  •  ,  ,  0  ,  •  •  •  ,  0  > 

and  <  0  ,  •••  ,  0  ,  Xj+i  ,  •••  ,  >)  where 

both  subfunctions  are  in  the  cost-table(if  either  one 
or  both  are  not  in  the  cost-table,  discard  the  com¬ 
position).  There  are  r  —  1  such  decompositions. 
From  among  these  decompositions  and  the  decom¬ 
positions  chosen  in  Steps  2  and  3,  choose  the  one 
with  the  lowest  cost. 


3.3  Examples 

Fig.  5  shows  two  examples  of  the  Vertical  Partitioning 
algorithm. 

Example:  Fig.  5a  shows  how  <  3, 0,2,3  >  is  de¬ 
composed  with  the  cost-table  shown  in  Table  1.  At 
level  0,  this  function  is  partitioned  into  <  3, 0,0,0  > 
and  <  0,0, 2, 3  >.  Function  <  3, 0,0,0  >  is  in  the 
cost-table  and  is  therefore  marked  SOLVED,  with  a 
cost  Q  =  8.  Since  the  other  function  <  0,0, 2, 3  > 
is  not  in  Table  1,  it  is  decomposed  as  <  0,0,1,!  > 
or  <  0,0,0, 1  >  by  subtracting  the  staircase  function 


f  =<  0,0, 1,2  >,  with  a  cost  Q{f)  =  4  or  the  block 
function  /'  =<  0,0, 2, 2  >,  with  a  cost  Q(/')  =  10. 
Because  it  is  not  possible  to  subtract  a  non-zero  con¬ 
stant,  the  corresponding  node  is  marked  FUTILITY. 
Both  <  0, 0, 1, 1  >  and  <  0, 0, 0, 1  >  are  cost-table  func¬ 
tions  and  the  minimal  realization  can  be  obtained  as 

<  3, 0,0,0  >  -h  <  0,0, 1,2  >  -f-  <  0,0, 1,1  >  with  a 
toted  cost  of  21. 

Example;  Fig.  5b  shows  the  realization  of  <  3, 2, 1, 1  > 
with  the  cost-table  shown  in  Table  2.  Since  <  3, 2, 1, 1  > 
contains  no  leftmost  internal  0,  there  is  no  partition 
at  level  0.  Thus,  the  left  function  is  marked  FUTIL¬ 
ITY  and  consequently  the  right  function  is  a  copy  of 
parent  node  function.  Next  generate  three  potential 
component  functions  <  1,1,1, 1  >(constant  function), 

<  2,2,0,0  >  (block  function)  and  <  3,2, 1,0  >  (stair¬ 
case  function)  with  costs  1,  7.5  and  8,  respectively. 
First,  <  3,2, 1,1  >  is  decomposed  as  <  1,1, 1,1  >  + 

<  2, 1,0,0  >  at  the  right-hand  ORjsuccessor,  since  its 
cost  is  the  lowest  among  the  component  functions  at  its 
tree  level.  Node  <  2, 1,0,0  >  is  marked  SOLVED  be¬ 
cause  it  is  a  cost-table  function  at  a  cost  0  =  7.  At 
this  point,  one  solution  is  found  with  the  total  cost  8 
by  summing  Q{f*)  =  1  with  0  =  7.  Next,  test  the 
next  lowest  component,  i.e.,  the  middle  OR^uccessor. 
Here,  OR  node  function  <  3,2, 1,1  >  can  be  decom¬ 
posed  as  <  2, 2, 0,0  >  -f  <  1,0, 1,1  >  ,  the  latter 
of  which  can  further  be  partitioned  into  <  1, 0,0,0  > 
and  <  0,0, 1,1  >  (since  <  1,0, 1,1  >  is  not  a  cost- 
table  function).  Both  <  1,0, 0,0  >  and  <  0,0, 1,1  > 
are  cost-table  functions  and  are  marked  SOLVED  with 
cost  Q  =  5.5  and  Q  =  4.5,  respectively.  Now  that 
the  middle  OR-successor  is  solved.  The  total  cost  for 
this  path  is  calculated  by  summing  the  cost  Q  for 

<  1,0, 0,0  >,  <  0,0, 1,1  >,  and  <  1,0, 1,1  >  for  a 
total  cost  larger  than  the  solution  already  solved  for  the 
right-hand  ORjsuccessor.  Therefore,  the  solution  for 
the  middle  ORjsuccessor  is  discarded.  The  remaining 
left-hand  OR^uccessor  can  also  be  discarded,  since  its 
total  cost  will  surely  be  greater  than  cost  of  the  com¬ 
ponent  function,  <  3,2, 1,0  >,  which  is  8.  Thus,  the 
solution  is  <  1,1, 1,1  >  +  <  2, 1,0,0  >  with  a  cost 
Q  =  8. 

Example:  In  [14],  <  1,3,0, 3  >  is  realized  at  a  cost 
Q  =  24  using  <  1,0, 0,0  >  and  <  0,0, 1,0  >  with  a 
multiplication  factor  of  2  and  3  respectively  followed  by 
complementation  with  respect  to  a  constant  3.  With  the 
cost-table  shown  in  Table  1  presented  here,  it  is  realized 
at  a  cost  of  Q  =  20  as  shown  in  Fig.  6. 

Example:  Using  the  cost-table  of  Table  2,  the  Vertical 
Partitioning  algorithm  decomposes  <  0,2, 3, 2  >  into 

<  0,2, 2, 2  >  and  <  0,0, 1,0  >  at  a  cost  of  Q  =  13.5. 
However,  the  Exhaustive  Search  algorithm  decomposes 

<  0,2, 3, 2  >  into  <  0,1, 1,1  >  and  <  0, 1,2, 1  >  at  a 
cost  of  12.5,  which  is  better  than  that  produced  by  the 
Vertical  Partitioning  algorithm. 

4  Overall  Comparison  Results 

An  example  in  the  previous  section  shows  that  the  re¬ 
sults  of  the  Vertical  Partitioning  are  sometimes  worse 
than  Exhaustive  Search.  In  this  section,  we  further 
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<3  0  2  3  > 
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<3  2  1  1> 


FUTILITlf  <3  2  I  i> 


b) 


Figure  5:  Examples  of  the  application  of  the  vertical 
partitioning  algorithm 


quantify  the  performance  of  the  two  algorithms.  Fig. 
7  shows  the  result  of  applying  the  Vertical  Partitioning 
algorithm  to  255  4-valued  unary  functions  using  two 
cost-tables,  that  of  Table  1  and  Table  2.  Plotted  verti¬ 
cally  is  the  number  of  functions  with  a  minimum  cost  of 
realization  shown  along  the  horizontal  axis.  Data  points 
in  boxes  represent  the  cost-table  of  Table  1,  while  data 
points  marked  by  x’s  correspond  to  the  cost- table  shown 
in  Thble  2.  For  example,  the  box  at  (4,1)  means  there 
is  one  function  realized  at  a  cost  of  4  using  the  Vertical 
Partitioning  algorithm  on  the  cost-table  of  Table  1 .  The 
advantage  of  the  cost-table  of  Table  2  is  clearly  seen  as  a 
bulk  of  the  new  cost-table  histogram  is  to  the  left  of  the 
bulk  of  the  old  cost-table  histogram.  The  average  cost 
over  all  255  functions  is  Q=: 12. 1804  with  the  cost-table 
of  Table  2  compared  to  Q= 13.6039  with  the  cost-table 
of  Table  1.  A  similar  experiment  was  performed  using 
the  Exhaustive  Search  algorithm  on  the  cost-table  of 
Table  2.  The  cost- tables  of  Table  2  and  Table  1  yielded 
an  average  cost  of  Q=12.1235  and  Q=13.5686,  respec¬ 
tively,  Table  3  shows  the  computation  time  of  Vertical 
Partitioning  algorithm  as  it  compares  to  the  Exhaus¬ 
tive  Search.  That  is,  the  Vertical  Partitioning  algo¬ 
rithm  required  19.3  sec.  of  CPU  time  to  compute  the 
minirnal  realization  of  all  255  4-valued  unary  functions. 
This  is  five  times  faster  than  EX(3)  which  is  Exhaus¬ 
tive  Search  for  the  minimal  solution  over  all  combina¬ 
tions  of  three  cost-table  functions.  It  is  265  times  faster 
than  EX(4)  which  is  Exhaustive  Search  over  all  combi- 


Figure  6:  f{x)  =<  1,3,0,3  > 


Table  3:  Relative  Computation  Times  of  Three  Algo¬ 
rithms 


Algorithm 

CPU  Time  (sec.) 

Relative  Duration 

VP 

19.J 

1  ■ 

EX(3) 

96.6 

»  5 

exN) 

s»  53  X  EX(3) 

^265 

nations  of  four  cost-table  functions.  We  have  verified 
that  the  cost-table  implementation  EX(3)  is  the  same 
as  Exhaustive  Search,  These  observations  hold  because 
of  the  increased  size  of  the  cost-table  to  53.  As  a  result, 
we  didn’t  need  to  run  EX(4).  Therefore,  the  CPU  time 
for  EX (4)  is  an  estimate  in  Table  3. 

5  Conclusions 

We  have  proposed  the  Vertical  Partitioning  algorithm 
for  the  design  of  current-mode  CMOS  multiple-valued 
logic.  It  partitions  a  given  function  according  to  the 
location  of  logic  O’s.  The  algorithm  requires  search 
and  proceeds  in  a  manner  similar  to  the  heuristic 
algorithm(Ap*).  In  addition,  the  cost-table  of  Table 
2  is  proposed  in  which  threshold  detector  logic  circuits 
augment  operations  used  in  a  previous  cost- table.  The 
cost  of  realizing  functions  using  the  cost-table  of  Ta¬ 
ble  2  is  about  10%  less  than  with  Table  1.  That  is, 
using  the  Vertical  Partitioning  algorithm,  the  average 
cost  over  255  4-valued  functions  was  Q=12.1804  for  the 
cost-table  of  Table  2  verses  Q=13.6039  for  the  cost-table 
of  Thble  1,  a  reduction  of  11.7%.  The  new  algorithm 
does  not  achieve  the  performance  of  Exhaustive  Search 
which  finds  a  guaranteed  minimal  solution.  However, 
it  is  not  far  off,  only  0.0047%(calculated  from  the  total 
cost  difference  between  Vertical  Partitioning  algorithm 
and  Exhaustive  Search  divided  by  the  totw  cost  from 
Exhaustive  Search  for  all  4-valued  255  functions)  and  it 
is  faster. 

An  advantage  of  the  Vertical  Partitioning  method  is 
that  it  is  easily  extended  to  higher  radices  in  a  nat¬ 
ural  way.  We  expect  this,  because  functions  like  the 
constant  ard  staircase  will  be  relatively  inexpensive  in 
higher  radices  as  well. 
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Number  of  Functions 


Cost  Of  Function 


with  Table  1  with  Table  2 


Figure  7:  Comparison  of  255  4- valued  unary  functions 
in  two  cost-tables 
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